CDS
Accession Number | TCMCG078C16169 |
gbkey | CDS |
Protein Id | KAG0477178.1 |
Location | complement(join(43092169..43092293,43096758..43096827,43096913..43097035,43097118..43097225,43097695..43097902,43097985..43098411,43098546..43098751,43098922..43099149,43100071..43101410,43110708..43110857)) |
Organism | Vanilla planifolia |
locus_tag | HPP92_014019 |
Protein
Length | 994aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA633886, BioSample:SAMN14973820 |
db_source | JADCNL010000006.1 |
Definition | hypothetical protein HPP92_014019 [Vanilla planifolia] |
Locus_tag | HPP92_014019 |
EGGNOG-MAPPER Annotation
COG_category | G |
Description | Belongs to the glycosyltransferase 8 family |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction |
R05191
[VIEW IN KEGG] |
KEGG_rclass |
RC00005
[VIEW IN KEGG] |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] ko01003 [VIEW IN KEGG] |
KEGG_ko |
ko:K13648
[VIEW IN KEGG] |
EC |
2.4.1.43
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko00520
[VIEW IN KEGG] map00520 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGAGGCGGAGGGCGTTGGAATGGTGGAGATGGACGCCGCTTAGGCTCGTGGATTGGATCTGGTCGATTCTTGGTGTCTTCCTCGTCGCGGTTCTTGTCCTCTTCGTCGTGCAGCACCATCACCTCATCCCTCGCCAGCTGCCAATGCAGGTCAAAGGCACAGAGTTTGAAGCAATTCAAGCTGAGAAGTTGAACTTTACAGAAGAACTGTTGAGTAGCACATCATTTGCCAGGCAATTGGTTGATCAGGTCTCCCTAGCAAAAGCTTACTTAGTTCTTGCCAAGGAGCATGGTAACCTTGATTTTGCTTGGGAGCTTAGCTCACATATTAGAAACTGCCAAATATTGCTTTCTCAGGCGGCAATGAGTGGGAAGCGCATTACATTTGAGGAAGCCCATCCTGTTGTTCTTCAGCTCGCAAAGTCCATCTACAAAGCCCAAGATTACCACTATGATATCAGCACCAGTATTACAACTCTAAAGAAACATGCACAAGCTCTGGAGGAGCGTGCCATTGCCGCAACAGCACAGAGTGCAGCATTTGGCAGATTGGCTGTCAACTCCTTGCCAAAGAATCTCCGGTGTGTGAATGTCAAACTCATAACAGATTGGTTTGAAGACCCTAAACTCAAACAGCGTGCGGAAGAGCTGAAGAACTCCCTCCGGTTGACAGACATCAACCTATACCATTTCTGTCTCTTCTCAGATAATGTTCTGGCGACTTCAGTTGTGGTGAATTCTACCATTGCAAACGTAAAGCATCCACTACAGCTTGTCTTCCATGTGGTTACCAACAGCATCAGTTACAAAGCAATGGCTACCTGGTTCTTGAAGAATGACTTGAAAGGGTGCACAGTTTTGGTGAGAAGCGTCGAGGAGTTGTCCTGGTTGAATGAACCCTTCTCACCAGTGTTTGAACATCTGGCAAGAGCGGGAAAGGGAAGTTGGGATATGGGTTCACCCTCAATACTTGAATACCTGCGATTCTACATCCCAATGCTTCATCCATCTCTGGAGAGGATTGTGATTCTTGATGAAGACATTGTGGTTCAAAAAGATCTGACTCCTCTCTTCTCCCAGAACATGCATGGAAGTGTCATAGCGGCCGTGGAGACTTGCCTCGAGTCGTCCCATCGGCTTTACCATTATGTCAACTTTTCTCATCCTCTTATCAGCTCGACCTTCGATCCCCAGGTCTGTGGCTGGGCATTTGGGCTAAATGTGGTGGACCTTATAGCATGGAGGAAGTCAGATGTCACTGCCAGGTTTCATTACTGGTTGAAGCAGAATGCAGATCAAACCCTATGGAGGGATGGGATTTTGCCAGCAGGTCTCCTGGCATTTTATGGACTAATGGTTCCTCTTGATAGGAGATGGCATGTTCTTGGCTTGGGATACGACATGGAACTTGATGATAGGTTGATAGGAAGTGCAGCCAGCTTACACTTTAATGGCAATATGAAACCATGGCTGAAGTTGGCAATCAGCAGAAGGATGTGGAACCGGAAGAGAAGGCGGGAAGATCCGCCGTCAATCCATCCTCGTAACCGTTACGCTGATGAGCCGCCAGACTTTGGTCTCCTTGCCTCCCTGTACCCTTCCTTCAAGCAATTCGTCTTCTCCTCCCGTTCGGGCCGCCCCGCAATAGACTGGAAAGACTACAATGCCACCCGCGAGCTCACTCGCGTACTCCTCCTCCACGACCATGGCATCAATTGGTGGATTCCTGATGGCCAACTTTGCCCAACGGTGCCAAATCGTTTAAACTACATTCACTGGATTGATGACCTGCTATCTTCTGACCTCATCCCTAAAAGACAGACTTCGAATAACAAAGTCAAAGGCTTTGATATCGGCACTGGGGCTAACTGCATATACCCGCTCCTCGGTGCATCTTTACTTGGTTGGGAGTTTGTTGGCTCAGATGTCACAAAAGTAGCCCTTGAATGGGCTACAAAAAATGTTGAGAGCAACCCTAAGCTATTGGAACTCATCAAGATTAGGGATGCTACTGATCCATTTAGTTGTAGTGATGCTACTCAGAGTACAAGGGAGCTCGTTAGTGAGCTTCCTTCAAAATTGTTTTTTGTAGAGAAGGATGAGTCCCAAGGTCAAGAGCTGAAGGAGTGTGGAACTGTGCAACCGCCTGTACTTGTGGGTGTTGTTAAAGAAGGCGAAACTTTTGACTTTTGTATATGTAACCCTCCATTTTTTGAGAGCATTGAGGAAGCAGGTCTCAACCCGAAGACATCATGTGGTGGAACAACTGAAGAGATGGTTTGCCCTGGTGGAGAAATAACTTTTGTTACACAGATCATCAAGGATAGTGTTGTCCTCAAGTGTTCATTTAGGTGGTTCACAGTAATGATTGGGAGAAAGATTAACTTAAAAAGTCTAATGTCAAAGCTACGTGAAGTTGGAGTGTCTATAGTCAAAACTACGGAGTTTGTCCAGGGTCGTACAGCTCGATGGGGGCTTGCTTGGAGTTTCATGCCACCATGCAAGGACTTCATTTCATCTACTGTAGCTTTGAAAAGCCATTGTTCATTTACACTTGAGGGCCTGAACCGCCAATGTGGTGCATTTCAAGTCTTAAAAGCAGTGGAATCATTTTTCTTAGACAAGGGTGTTCCTTGTAAAATCGACTCTTCATCCTTCTGTATCAATGTAAATTTAAACAATGTGCAAGATAACACGGCAAATGAAATGGGCTTGAGTGATTTGTTAAAAGATGCTGAAAATCACTCCACAAAAGTATCAAATGGATCATCGTGTGCAGCACTTGTTTCGGTATTTGAACAAATTCCTGGTACAATCCTAGTCAGGTGTTCGCCATTTGGAAAAGATGGAACAGTTTCAGGACTATTGTCTTCCCTATTCATCCATTTGGAGGAACACCTCCGAAAGGAGTTCAGCGGCAAGTCCCACGGTTCACTTCATAAACAAGAATCTAAGAAACCATCGCTTGATGAGACATCACATTAG |
Protein: MRRRALEWWRWTPLRLVDWIWSILGVFLVAVLVLFVVQHHHLIPRQLPMQVKGTEFEAIQAEKLNFTEELLSSTSFARQLVDQVSLAKAYLVLAKEHGNLDFAWELSSHIRNCQILLSQAAMSGKRITFEEAHPVVLQLAKSIYKAQDYHYDISTSITTLKKHAQALEERAIAATAQSAAFGRLAVNSLPKNLRCVNVKLITDWFEDPKLKQRAEELKNSLRLTDINLYHFCLFSDNVLATSVVVNSTIANVKHPLQLVFHVVTNSISYKAMATWFLKNDLKGCTVLVRSVEELSWLNEPFSPVFEHLARAGKGSWDMGSPSILEYLRFYIPMLHPSLERIVILDEDIVVQKDLTPLFSQNMHGSVIAAVETCLESSHRLYHYVNFSHPLISSTFDPQVCGWAFGLNVVDLIAWRKSDVTARFHYWLKQNADQTLWRDGILPAGLLAFYGLMVPLDRRWHVLGLGYDMELDDRLIGSAASLHFNGNMKPWLKLAISRRMWNRKRRREDPPSIHPRNRYADEPPDFGLLASLYPSFKQFVFSSRSGRPAIDWKDYNATRELTRVLLLHDHGINWWIPDGQLCPTVPNRLNYIHWIDDLLSSDLIPKRQTSNNKVKGFDIGTGANCIYPLLGASLLGWEFVGSDVTKVALEWATKNVESNPKLLELIKIRDATDPFSCSDATQSTRELVSELPSKLFFVEKDESQGQELKECGTVQPPVLVGVVKEGETFDFCICNPPFFESIEEAGLNPKTSCGGTTEEMVCPGGEITFVTQIIKDSVVLKCSFRWFTVMIGRKINLKSLMSKLREVGVSIVKTTEFVQGRTARWGLAWSFMPPCKDFISSTVALKSHCSFTLEGLNRQCGAFQVLKAVESFFLDKGVPCKIDSSSFCINVNLNNVQDNTANEMGLSDLLKDAENHSTKVSNGSSCAALVSVFEQIPGTILVRCSPFGKDGTVSGLLSSLFIHLEEHLRKEFSGKSHGSLHKQESKKPSLDETSH |